A timbre space for speech
نویسندگان
چکیده
We describe a perceptual space for timbre, define an objective metric that takes into account perceptual orthogonality and measure the quality of timbre interpolation. We discuss two timbre representations and measure perceptual judgments. We determine that a timbre space based on Mel-frequency cepstral coefficients (MFCC) is a good model for perceptual timbre space.
منابع مشابه
Designing Sound: Towards a System for Designing Audio Interfaces using Timbre Spaces
The creation of audio interfaces is currently hampered by the difficulty of designing sounds for them. This paper presents a novel system for generating and manipulating non-speech sounds. The system is designed to generate Auditory Icons and Earcons through a common interface. Using a timbre space representation of the sound, it generates output via an FM synthesiser. The timbre space has been...
متن کاملA System for Manipulating Audio Interfaces Using Timbre Spaces
The creation of audio interfaces is currently hampered by the difficulty of designing sounds for them. This paper presents a novel system for generating and manipulating non-speech sounds. The system is designed to generate Auditory Icons and Earcons through a common interface. It has been developed to make the design of audio interfaces easier. Using a Timbre Space representation of the sound,...
متن کاملAutomatic Annotation of Timbre Variation for Musical Instruments
This paper proposes a preprocessing technique for the automatic transcription of performances produced by a musical instrument (or other sound source) capable of timbre variations. Voice recognition techniques will be exploited to gather information about timbre, then a clustering approach will be used to reduce data cardinality, and, finally, data dimensionality will be further reduced using m...
متن کاملGMM-PCA based speaker-timbre conversion on full-quality speech
This work addresses a study of the GMM-based approach to achieve full-quality speaker timbre conversion. In general, high-quality voice conversion requires accurate spectral envelope estimates, resulting in high-dimensional feature vectors and relatively high computational. Aiming to achieve lowdimensional processing, accurate envelope estimates of the speakers are mel-frequency scaled and proj...
متن کاملPitch-synchronous Speech Coding Based on Timbre Vectors
A pitch-synchronous method and system for speech coding using timbre vectors is disclosed. On the encoder side, speech signal is segmented into pitch-synchronous frames without overlap, then converted into a pitch-synchronous amplitude spectrum using FFT. Using Laguerre functions, the amplitude spectrum is transformed into a timbre vector. Using vector quantization, each timbre vector is conver...
متن کامل